Tensor decision trees for continual learning from drifting data streams
نویسندگان
چکیده
Data stream classification is one of the most vital areas contemporary machine learning, as many real-life problems generate data continuously and in large volumes. However, research this area focuses on vector-based representations, which are unsuitable for capturing properties more complex multi-dimensional structures, such images video sequences. In paper, we propose a novel methodology learning adaptive decision trees from streams tensors. We introduce Chordal Kernel Decision Tree continual tensor streams. order to maintain characteristics, train update classifiers kernel space designed work with representation. use chordal distance compute similarities between tensors then apply it new feature trained. This allows direct tree induction accommodate streaming drifting nature data, concept drift detection scheme based It us reconstruct every time when change detected. The proposed approach fast efficient Experimental study, conducted 4 real-world 52 artificial large-scale streams, shows that using native leads accurate than outperforms vectorized representations.
منابع مشابه
Cost-Sensitive Perceptron Decision Trees for Imbalanced Drifting Data Streams
Mining streaming and drifting data is among the most popular contemporary applications of machine learning methods. Due to the potentially unbounded number of instances arriving rapidly, evolving concepts and limitations imposed on utilized computational resources, there is a need to develop efficient and adaptive algorithms that can handle such problems. These learning difficulties can be furt...
متن کاملConcept Drift in Decision Trees Learning from Data Streams
This paper presents the Ultra Fast Forest of Trees (UFFT) system. It is an incremental algorithm that works online, processing each example in constant time, and performing a single scan over the training examples. The system has been designed for numerical data. It uses analytical techniques to choose the splitting criteria, and the information gain to estimate the merit of each possible split...
متن کاملLearning from Concept Drifting Data Streams with Unlabeled Data
Contrary to the previous beliefs that all arrived streaming data are labeled and the class labels are immediately available, we propose a Semi-supervised classification algorithm for data streams with concept drifts and UNlabeled data, called SUN. SUN is based on an evolved decision tree. In terms of deviation between history concept clusters and new ones generated by a developed clustering alg...
متن کاملLearning in Dynamic Environments: Decision Trees for Data Streams
This paper presents an adaptive learning system for induction of forest of trees from data streams able to detect Concept Drift. We have extended our previous work on Ultra Fast Forest Trees (UFFT) with the ability to detect concept drift in the distribution of the examples. The Ultra Fast Forest of Trees is an incremental algorithm, that works online, processing each example in constant time, ...
متن کاملLearning to Rank from Concept-Drifting Network Data Streams
Networked data are, nowadays, collected in various application domains such as social networks, biological networks, sensor networks, spatial networks, peer-to-peer networks etc. Recently, the application of data stream mining to networked data, in order to study their evolution over time, is receiving increasing attention in the research community. Following this main stream of research, we pr...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Machine Learning
سال: 2021
ISSN: ['0885-6125', '1573-0565']
DOI: https://doi.org/10.1007/s10994-021-06054-y